Robust feature vector compression algorithm for distributed speech recognition
نویسندگان
چکیده
In this paper we propose an algorithm for efficient compression of feature extracted parameters used in speech recognition. The algorithm provides a compression ratio of roughly 1:10 and causes negligible or no loss in recognition performance. It is also shown to be robust against enviromental noise. Combined with an appropriate framing structure, a complete system is obtained, which can be used for implementing speech recognition applications e.g. in a cellular mobile environment. The system achieves a gross bitrate as low as 4200 bps.
منابع مشابه
Improving the performance of MFCC for Persian robust speech recognition
The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. In this paper to achieve a satisfactorily performance in Automatic Speech Recognition (ASR) applications we introduce a noise robust new set of MFCC vector estimated through following steps. First, spectral mean normalization is a pre-processing which applies to t...
متن کاملImproving of Feature Selection in Speech Emotion Recognition Based-on Hybrid Evolutionary Algorithms
One of the important issues in speech emotion recognizing is selecting of appropriate feature sets in order to improve the detection rate and classification accuracy. In last studies researchers tried to select the appropriate features for classification by using the selecting and reducing the space of features methods, such as the Fisher and PCA. In this research, a hybrid evolutionary algorit...
متن کاملA packetization and variable bitrate interframe compression scheme for vector quantizer-based distributed speech recognition
We propose a novel packetization and variable bitrate compression scheme for DSR source coding, based on the Group of Pictures concept from video coding. The proposed algorithm simultaneously packetizes and further compresses source coded features using the high interframe correlation of speech, and is compatible with a variety of VQ-based DSR source coders. The algorithm approximates vector qu...
متن کاملNoise reduction for noise robust feature extraction for distributed speech recognition
This paper describes the noise robust feature extraction meth ods developed by France Telecom and Alcatel for the noise robust front-end standardisation of ETSI Aurora. It is shown that both noise reduction methods give a substantial im provement when compared to a standard MFCC feature ex traction algorithm for speech recognition in noisy environ ments. In addition, blind equalisation and ...
متن کاملA Low-resource, Miniature of the Etsi Distributed Speech R
The purpose of this work is to demonstrate that distributed speech recognition front-ends can be deployed in environments which providefor very little power and CPU resources, with possibly no degradation of speech recognition quality when compared to standard floatingpoint implementations. The ETSI distributed speech recognition front-end standard is implemented on an ultra low-power miniature...
متن کامل